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Abstract 

Complexion  plays  a  remarkably  important  role  in  recog¬ 
nition.  Experiments  with  human  subjects  have  shown  that 
complexion  provides  as  much  distinctiveness  as  other  well- 
known  features  such  as  the  shape  of  the  face.  From  the 
perspective  an  autonomous  robot,  changes  in  lighting  (e.g., 
intensity,  orientation )  and  camera  parameters  (e.g.,  white 
balance )  can  make  capturing  complexion  challenging.  In 
this  paper,  we  evaluate  complexion  as  a  soft  biometric  us¬ 
ing  color  (histograms)  and  texture  (local  binary  patterns). 
We  train  a  linear  SVM  to  distinguish  between  the  individual 
and  impostors.  We  demonstrate  the  performance  of  this  ap¬ 
proach  on  a  database  of  over  200  individuals  collected  to 
study  biometrics  in  human-robot  interaction.  In  our  exper¬ 
iment,  we  identify  9  individuals  that  interact  with  the  robot 
on  a  regular  basis,  rejecting  all  others  as  unknown. 

1.  Introduction 

An  autonomous  robot  must  be  able  to  learn  the  identities 
of  individuals  with  whom  it  interacts.  Identity  is  impor¬ 
tant  for  a  number  of  reasons.  For  example,  a  robot  should 
be  able  to  remember  preferences,  interaction  histories,  etc. 
When  identity  is  first  learned,  we  anticipate  the  person  to 
be  generally  cooperative,  but  may  be  speaking,  changing 
head  pose  and  facial  expression.  A  robot  may  be  expected 
to  operate  in  uncontrolled  environments,  where  the  light¬ 
ing  during  evaluation  may  not  be  the  same  as  it  was  during 
enrollment. 

Our  goal  is  for  autonomous  systems  to  be  able  to  meet 
and  remember  new  people,  in  a  way  that  is  much  like  hu¬ 
mans.  For  this  to  be  possible,  all  biometric  signatures  must 
have  a  short  enrollment  time,  preferably  real-time.  Enroll¬ 
ment  must  occur  completely  autonomously.  Biometric  sig¬ 
natures  should  be  invariant  to  changes  in  illumination  (e.g., 
outdoor,  indoor,  etc.).  Finally,  we  expect  both  a  high  level 
of  accuracy  and  permanence. 
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In  this  paper,  we  examine  the  use  of  complexion  as 
a  partial  solution  to  this  problem.  Complexion  is  a  soft 
biometric[4].  Soft  biometrics  lack  the  distinctiveness  and 
permanence  of  traditional  modalities  (e.g.,  face  recogni¬ 
tion,  fingerprint  recognition,  etc.).  However,  they  can  be 
a  powerful  way  of  identifying  people  when  combined  to¬ 
gether  [12,  13,  8,  2].  In  complexion,  we  measure  the  color 
appearance  of  the  head,  which  can  include  skin  color,  hair 
color,  facial  hair  color,  etc.  Complexion  has  nice  proper¬ 
ties  that  make  it  ideal  for  an  autonomous  robot.  Complex¬ 
ion  does  not  require  high  resolution,  is  not  greatly  affected 
by  changes  in  pose,  and  can  be  both  learned  and  evaluated 
quickly. 

Although  it  has  not  been  studied  extensively  as  a  soft 
biometric,  complexion  appears  to  be  one  of  the  more  impor¬ 
tant  cues  that  humans  use  for  recognition.  Sinha  et  al.  [14] 
demonstrated  that  pigmentation  cues  provide  as  much  dis¬ 
tinctiveness  as  shape  cues.  This  effect  becomes  even  more 
pronounced  when  dealing  with  faces  that  have  reduced  reso¬ 
lution,  which  can  obscure  shape.  Pascalis  and  Schonen  [10] 
demonstrated  that  the  contrast  between  hair  and  skin  is  one 
of  the  more  important  feature  used  by  neonates  to  recog¬ 
nize  their  mothers.  By  obscuring  the  hair,  they  effectively 
reduced  the  neonates  ability  to  distinguish  between  their 
mother  and  a  stranger.  For  neonates,  this  cue  is  even  more 
important  than  intrinsic  facial  features. 

The  major  challenge  is  providing  invariance  to  changes 
in  illumination  and  camera  parameters,  which  can  greatly 
affect  the  accuracy  of  identification.  Changes  in  illumina¬ 
tion  are  caused  by  differences  in  the  position  and  orienta¬ 
tion  of  the  light  source(s)  in  the  environment.  For  example, 
shadows  cast  on  the  face  can  make  the  complexion  seem 
darker  than  it  actually  is.  Prior  work  in  skin  detection  has 
demonstrated  that  the  Hue  /  Saturation  (HS)  color  space 
provides  a  measure  of  invariance  to  shadow.  Changes  in 
the  camera  parameters  can  affect  the  white  balance  of  the 
camera.  This  can  greatly  affect  the  measured  color  from 
the  cameras.  We  can  compensate  for  this  by  measuring  the 
observed  texture  on  the  face,  using  local  binary  patterns. 

In  this  paper,  we  capture  complexion  using  both  Hue- 
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Saturation  color  histograms  and  center- symmetric  local  bi¬ 
nary  patterns  (CS-LBP)  [11].  We  compare  the  performance 
of  each  approach,  showing  the  benefits  of  each  in  different 
situations. 


Figure  1 .  The  MDS  robot  used  in  this  experiment, 

Our  robotics  platform  is  an  Mobile-Dextrous-Social 
(MDS)  robotics  named  Octavia.  Octavia  is  equipped  with 
an  RGB-D  sensor  using  a  pair  of  color  cameras  in  her  eyes 
and  a  SwissRanger  SR-4000  camera  in  the  forehead.  The 
SR-4000  is  used  to  detect  individuals  in  the  environment,  so 
that  Octavia’s  eyes  (i.e.  RGB  cameras)  look  directly  people 
when  she  is  interacting  with  them. 

To  test  identification  in  a  human-robot  interaction  sce¬ 
nario,  we  designed  an  experiment  where  the  public  was 
asked  to  interact  with  Octavia  over  a  5  day  period.  We  had 
over  200  individuals  participate,  9  of  which  were  experi¬ 
menters  that  interacted  with  the  robot  on  a  regular  basis. 
We  use  complexion  to  identify  the  experimenters  that  inter¬ 
act  with  the  robot  regularly,  while  rejecting  everybody  else 
as  unfamiliar. 

The  remainder  of  the  paper  is  organized  as  follows.  We 
report  on  related  work  in  section  2.  We  introduce  methodol¬ 
ogy  in  section  3,  then  show  experimental  results  in  section 
4.  We  conclude  in  section  5  with  a  discussion  of  our  results 
and  future  directions. 

2.  Related  Work 

Complexion  is  closely  related  to  the  problem  of  detec¬ 
tion  skin  in  images.  Zarit  et  al.  [16]  studied  skin  detection 
in  five  different  color  spaces:  normalized  RGB,  YCrCb, 
HSV,  Fleck  HS,  and  CIE  LAB.  As  has  been  consistent 
with  other  more  recent  works  (e.g.,  [5]),  Zarit  found  that 
Hue/Saturation/ Value  (HSV)  tends  to  perform  slightly  bet¬ 
ter  than  or  comparably  to  many  other  metrics. 

One  important  question  is  invariance  to  lighting. 
Some  [9]  have  proposed  that  the  illuminance  (or  Value) 


channel  is  most  affected  by  shadow,  and  that  removing  this 
component  can  increase  performance.  However,  Khan  et 
al.  [5]  demonstrated  that  while  this  may  be  true,  remov¬ 
ing  the  illuminance  channel  can  also  decrease  skin  detection 
performance.  From  the  perspective  of  complexion  as  a  soft 
biometric,  this  introduces  an  interesting  trade-off.  Without 
this  component  (i.e.,  using  Hue/Saturation  -  HS),  we  may 
be  invariant  to  illumination  conditions,  however,  with  this 
component  accuracy  may  increase.  In  our  case,  lighting  in¬ 
variance  is  of  critical  importance,  so  we  choose  to  use  the 
HS  color  space.  When  the  lighting  is  controlled,  HSV  may 
be  more  appropriate. 

Complexion  as  a  soft  biometric  is  also  closely  related 
to  color  based  object  recognition  [6],  and  detection  [15]. 
Recent  works  by  Zhu  et  al  [17]  have  suggested  the  use  of 
local  binary  patterns  to  handle  changes  in  lighting.  LBPs 
are  computed  using  a  difference  in  values  from  the  center 
pixel,  which  implicitly  handles  a  situation  of  a  linear  change 
in  color  intensity  [17].  However,  CS-LBPs  alone  do  not 
describe  the  color  of  the  object,  they  simple  describe  the 
relative  difference  between  color  pixels.  While  this  makes 
it  an  effective  way  of  describing  texture,  its  not  appropriate 
to  distinguish  between  different  colored  objects  that  may, 
perhaps  have  a  similar  texture. 

Local  binary  patterns  can  also  be  used  to  recognize 
faces  [1].  In  this  work,  the  authors  explored  the  tradeoff 
between  the  radius  and  sampling  size  of  the  LBP  as  well  as 
the  size  of  the  extracted  patches.  As  noted  by  the  authors, 
as  the  sizes  of  the  patches  increase,  the  spatial  information 
becomes  increasingly  lost.  While  this  is  a  negative  for  face 
recognition,  it  may  be  a  positive  for  the  use  of  LBP  as  a  soft 
biometric.  By  using  the  face  as  a  whole,  we  may  be  able  to 
capture  more  holistic  features  capturing  information  such 
as  the  general  shape  of  the  face  and  the  contrast  between 
features. 

Some  prior  works  have  addressed  the  question  of  com¬ 
plexion  as  a  soft  biometric  from  the  point  of  view  of  eth¬ 
nicity  [7,  3]  or  using  categorical  labels  (e.g.,  light  or  dark 
skinned)  [2] .  While  these  works  have  shown  promise,  they 
remove  a  great  deal  of  distinctive  information  that  is  present 
due  t  o  the  ranges  of  skin  color  that  are  possible  within  eth¬ 
nic  groups. 

3.  Methodology 

In  this  section,  we  discuss  our  approach  to  modeling 
complexion  using  both  color  (section  3.3)  and  texture  (sec¬ 
tion  3.2).  However,  we  begin  with  a  brief  overview  of  the 
problem  of  modeling  color  and  the  challenges  of  color  in 
the  presence  of  illumination  and  shadow. 

3.1.  Color  Invariance 

The  images  in  figure  2  illustrate  a  typical  problem:  vari¬ 
ation  in  illumination  conditions.  In  some  cases,  parts  of  the 


face  are  cast  in  shadow,  in  other  cases  specular  reflections 
are  caused  by  different  lighting  conditions.  In  this  section, 
we  review  color  invariance  as  described  by  Van  De  Sande 
etal.  [15]. 


Figure  2.  Complexion  can  vary  greatly  due  to  different  illumina¬ 
tion  conditions.  The  figure  above  shows  the  same  individuals  in 
different  lighting  conditions.  Some  lighting  changes  were  due  to 
specular  reflections  (right);  others  were  due  to  shadow  (left). 

Changes  in  color  values  can  be  classified  into  6  different 
categories.  The  first  category  (light  intensity  change)  can 
model  the  effect  of  shadows  by  providing  a  linear  scaling  of 
each  color  channel. 
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The  second  category  (light  intensity  shift)  models  the  in¬ 
tensity  at  which  light  appears  at  a  pixel,  which  can  model 
specular  reflections. 


The  third  category  (light  intensity  change  +  light  inten¬ 
sity  shift)  models  both  together: 
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Note  that  in  Eq.  1-3  each  channel  is  scaled  by  the 
same  factor,  a  or  adjusted  by  the  same  amount  o\.  Scaling 
each  channel  by  different  factors  a,  6,  c  and/or  adjusting  by 
different  amounts  oi,  02, 03  can  adjust  different  channels  in 
different  ways.  For  example,  the  red  channel  can  be  scaled 
by  a  =  1.2,  whereas  the  other  channels  are  not  scaled  at 
all  (b  =  c  =  1.0).  Such  situations  are  common  in  white 


balancing,  where  the  color  space  can  be  slightly  different  in 
time  1  and  time  2. 

From  figure  2  it  is  clear  that  both  shadow  and  specu¬ 
lar  reflection  are  common  problems  ,  but  can  be  adequately 
described  by  eq.  1-3.  When  images  are  taken  at  different 
points  of  time,  it  is  possible  that  the  camera  will  be  white 
balanced  differently,  which  will  cause  the  color  channels  to 
vary  indepedently. 

The  RGB  color  space  is  not  invariant  to  any  scaling  or 
adjusting  factor,  which  makes  it  unsuitable  for  modeling 
complexion.  Prior  work  ( [15])  has  shown  that  the  HS  color 
space  is  tolerant  to  light  intensity  change,  light  intensity  sift, 
and  a  combination  of  the  three  (i.e.,  eq.  1-3). 

Invariance  to  adjusting  and  scaling  by  different  factors 
is  much  more  difficult.  On  potential  solution  is  the  center 
symmetric  local  binary  pattern  (CS-FBP)  [17,  11].  FBPs 
are  computed  by  comparing  the  difference  between  a  pixel 
values.  Scaling  by  the  same  factor  does  not  impact  this  de¬ 
cision. 

3.2.  Local  Binary  Patterns 

Focal  binary  patterns  [11]  are  a  flexible  and  effective 
way  of  describing  the  texture.  The  flexibility  of  FBPs  are 
due  to  the  various  ways  in  they  can  be  formulated. 
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Figure  3.  Local  binary  patterns  are  computed  by  comparing  the 
center  pixel  (white  circle)  to  surrounding  pixels  (black  circle). 

The  canonical  local  binary  pattern  (see  Figure  3)  is  com¬ 
puted  by  comparing  the  value  of  the  center  pixel  to  the  sur¬ 
rounding  pixels.  This  comparison  results  in  an  8  bit  num¬ 
ber,  where  each  comparison  results  in  a  1  if  the  surrounding 
(black)  pixel  is  greater  than  the  center  (white)  pixel.  In¬ 
tuitively,  each  number  represents  a  different  kind  of  tex¬ 
ture.  FBP=0  when  there  is  a  flat  region  of  no  texture, 
FBP=255  when  there  is  a  spot,  other  numbers  represent  cor¬ 
ners,  edges,  etc. 

The  basic  FBP  used  a  radius  of  1,  it  is  possible  to  ex¬ 
tend  this  out  by  comparing  values  at  other  radii.  For  exam¬ 
ple,  when  radius=2,  we  look  at  neighbors  that  are  2  pixels 
away.  This  can  result  in  either  a  16-bit  or  an  8-bit  number. 
However,  we  use  an  8-bit  number  by  interpolating  between 
values  when  a  comparison  falls  between  several  possible 
pixels. 

To  summarize  the  texture  of  a  region,  we  accumulate  val¬ 
ues  into  a  histogram,  where  each  bin  describes  a  different 


kind  of  texture.  By  looking  at  smaller  regions,  we  can  find 
a  more  local  description  of  texture.  In  our  experiments,  we 
divide  the  face  into  the  top  and  bottom,  summarizing  the 
textures  present  in  each  separately. 

Hue- Saturation  local  binary  patterns  (HS-LBP)  are  com¬ 
puted  by  finding  the  local  binary  pattern  on  each  channel 
independently,  then  concatenating  the  resulting  histograms. 
In  [1]  a  similar  approach  was  used  for  face  recognition, 
where  it  was  shown  experimentally  that  radius=2  sampling 
along  8  different  bits  provides  a  great  deal  of  effectiveness. 

3.3.  Histograms 

We  detect  faces  using  the  Pittsburgh  Pattern  Recognition 
SDK.  We  resize  the  detected  faces  to  a  normalized  size  of 
10  pixels  interocular.  Typical  examples  are  shown  in  fig¬ 
ure  4.  We  compute  the  color  histograms  for  the  Hue,  Satu¬ 
ration  channels  separately,  using  10  bins  for  each  channel. 
We  concatenate  the  two  histograms  resulting  in  20  different 
color  features  for  each  image. 


Figure  4.  Low  resolution  images  are  used  to  learn  the  complexion 
of  participants. 


3.4.  Classification 

For  both  the  purposes  of  efficiency  and  classification  ac¬ 
curacy,  we  use  a  linear  support  vector  machine  to  identify 
individuals  using  the  features  obtained  either  through  color 
histograms  or  through  color  local  binary  patterns. 

Given  the  set  of  features  Xi  and  the  associated  label  a 
two  class  problem  is  solved  using  a  linear  discriminant  a  to 
project  a  pattern  x 

g(y)  =  alx  (4) 

An  unknown  pattern  will  be  classified  using  a,  where  the 
resulting  prediction  is  based  on  the  the  sign  of  g.  Data  is 
usually  situated  in  a  manner  where  there  are  a  great  many 
possible  hyperplanes  that  will  solve  the  problem  in  the  man¬ 
ner  outlined  above.  The  most  obvious  solution  may  be  a 
pair  of  hyperplanes,  one  of  which  lies  on  the  positive  deci¬ 
sion  boundary,  the  other  on  the  negative  decision  boundary. 
The  margin  is  the  distance  between  these  two  hyperplanes. 
The  most  general  solution  will  be  the  one  that  maximizes 
this  margin 


(5) 
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maximizing  b ,  subject  to  b\  \a\  |  =  1.  Support  vectors  will 
be  those  points  that  lie  on  the  boundary  with  the  maximum 
margin.  These  represent  the  patterns  that  are  most  difficult 
to  classify,  the  removal  of  these  points  from  the  training  set 
will  result  in  a  different  margin. 

It  is  common  to  make  use  of  a  kernel  function  in  cases 
where  the  data  is  not  linearly  separable.  In  this  case,  rather 
than  comparing  two  patterns  Xi  and  xj  directly,  we  compare 
k  (xi  ,Xj),  where  the  kernel  k  generally  is  a  polynomial  or  an 
RBF  kernel.  This  may  result  in  an  improved  classification 
accuracy.  However,  the  desire  for  efficient  classification,  we 
limit  experimentation  to  a  linear  kernel. 

4.  Experimental  Results 

To  evaluate  biometrics  for  human-robot  interaction,  we 
designed  a  study  in  the  form  of  a  “game”  where  three  indi¬ 
viduals  have  a  short  interaction  with  the  robot.  The  game  is 
divided  into  two  separate  phases:  a  training  phase,  and  an 
evaluation  phase. 

In  the  first  phase,  the  robot  needed  to  acquire  a  model 
and  an  identifier  for  each  individual.  For  each  of  three  pre¬ 
specified  locations,  she  rotated  her  head  and  torso  to  center 
that  location  in  the  image.  Then  she  looked  straight,  up, 
and  then  down,  trying  each  position  until  a  face  was  found 
in  the  SR4000  intensity  image.  After  detecting  the  indi¬ 
vidual,  the  robot  readjusted  the  necessary  motors  to  place 
the  individual  at  the  top  of  the  image  in  both  the  RGB  and 
depth  cameras.  Additional  adjustments  to  pose  were  made 
throughout  the  entire  game  whenever  pose  varied  by  at  least 
0.1  rad. 

After  detecting  the  individual  and  adjusting  pose,  the 
robot  first  asked  for  an  identifier.  This  was  done  as  a  ques¬ 
tion  about  the  volunteers  favorite  flavor  of  ice  cream,  color, 
animal,  planet,  or  sport.  The  answer  provided  by  the  vol¬ 
unteer  was  manually  entered  by  the  experimenter  into  the 
person-id  system,  and  was  attached  to  all  subsequently  col¬ 
lected  data  and  biometric  models.  When  the  experimenters 
participated,  their  names  were  used  to  preserve  consistency. 

With  an  established  identifier,  the  robot  asked  the  volun¬ 
teer  to  speak  on  a  subject  for  up  to  30-sec.  Subjects  were 
randomly  selected  from  one  of  5  topics.  The  robot  recorded 
20- sec  of  data  after  asking  the  volunteer  about  a  subject. 
Although  participants  were  told  beforehand  that  they  could 
speak  for  as  long  or  as  little  as  they  would  like,  most  par¬ 
ticipants  spoke  only  a  sentence  in  response.  Despite  the 
limited  utterance  length,  20-sec  was  still  necessary  to  guar¬ 
antee  participants  looked  at  the  robot  long  enough  to  build 
a  model.  Often,  they  would  look  around  while  the  robot 
was  building  models,  limiting  the  number  of  full-on  frontal 
views  of  the  face. 


Once  the  training  phase  was  completed  for  all  three  vol¬ 
unteers,  the  robot  said,  Now  I  will  close  my  eyes  and  people 
should  move  around.  Then  I  will  try  to  recognize  each  of 
you.  She  then  closed  her  eyes  and  slowly  counted  to  five. 
People  were  told  previously  that  they  could  either  move  or 
stay  where  they  were  to  try  and  fool  the  robot,  but  they  over¬ 
whelmingly  choose  to  change  positions.  On  reaching  the 
number  five,  the  robot  said,  Ready  or  not,  here  I  come,  and 
opened  her  eyes. 

In  the  final  phase,  the  robot  again  turned  to  face  each 
of  the  locations  in  turn  and  tried  to  recognize  the  individ¬ 
ual.  As  during  training,  the  robot  had  to  find  the  individuals 
face  first  by  looking  up  and  down,  and  then  continuously 
updated  her  viewing  angle  to  keep  the  face  towards  the  top 
of  the  image  in  both  cameras.  Once  they  were  detected,  she 
said,  I’m  sure  I  can  do  this,  but  maybe  you  could  give  me 
a  little  hint  about  your  identity?  Because  participants  were 
warned  ahead  of  time  that  this  was  optional,  most  simply 
stated,  No  or  shook  their  heads  in  the  negative.  The  pur¬ 
pose  of  asking  for  a  hint  was  to  encourage  the  subjects  to 
provide  facial  expression  and/or  speech  data,  and  was  not 
used  during  the  identification  process. 

After  asking  for  a  hint,  the  robot  again  looked  at  the  sub¬ 
ject  for  20-sec,  using  all  images  where  a  face  was  detected 
to  match  the  biometric  signatures.  At  the  end,  she  guessed 
the  participants  identity  from  the  3  possible  subjects  based 
on  maximum  likelihood.  We  have  reported  the  results  from 
this  “shell  game”  in  [8],  where  we  made  use  of  multiple  soft 
biometrics  to  identify  individuals.  In  this  work,  we  limit  the 
analysis  to  complexion  information. 

We  evaluate  open-world  identification  of  9  experi¬ 
menters  using  complexion  information  alone.  For  the  pur¬ 
poses  of  evaluation,  we  split  those  sessions  that  do  not  be¬ 
long  to  the  experimenters  into  5  different  folds.  At  each 
step,  one  fold  is  used  to  train  the  classifier,  the  remaining 
four  are  used  for  evaluation. 

Experimental  results  are  shown  the  ROC  curve  in  fig¬ 
ure  5.  It’s  interesting  to  note  the  overall  performance  of 
complexion,  as  well  as  the  fact  that  neither  approach  domi¬ 
nates  the  other. 

4.1.  Results  in  Transformed  Color  Space 

To  illustrate  the  problems  introduced  by  different  color 
balancing  algorithms,  we  synthetically  create  a  second  data 
set,  where  the  red  channel  in  the  test  images  are  scaled  by  a 
factor  of  1.2,  while  keeping  the  blue  and  green  channels  the 
same.  The  results  of  this  experiment  are  in  figure  6. 

This  produces  some  dramatically  different  results,  but 
shows  the  effectiveness  of  LBPs. 

We  can  draw  several  conclusions  from  these  two  exper¬ 
iments.  First,  in  cases  where  color  camera  calibrations  do 
not  change,  HS  color  histograms  are  an  effective  tool.  How¬ 
ever,  over  the  long  term,  CS-LBPs  are  likely  to  be  the  more 


Figure  5.  ROC  showing  performance  of  complexion  using  soft  bio¬ 
metrics.  In  this  experiment,  we  evaluate  complexion  by  comparing 
the  subject  against  191  non-subjects  using  5-fold  cross-validation. 


Figure  6.  ROC  showing  performance  of  complexion  using  soft  bio¬ 
metrics.  In  this  experiment,  we  evaluate  complexion  by  comparing 
the  subject  against  191  non-subjects  using  5-fold  cross-validation. 


robust  approach. 

5.  Discussion 

We’ve  shown  in  this  paper  the  effectiveness  of  color  for 
complexion  analysis.  The  strength  of  the  HS-color  his¬ 
togram  lies  in  it’s  ability  to  distinguish  between  people  in 
the  same  color  space.  When  the  color  parameters  are  ad¬ 
justed,  it  loses  this  ability.  The  CS-LBPs  in  this  case  per¬ 
form  quite  well. 

Complexion  has  a  number  of  advantages,  which  we  in¬ 
tend  to  fully  investigate  in  future  works.  Among  these 
advantages  are  the  ability  to  recognize  individuals  in  low 
resolution  images.  This  provides  the  ability  to  recognize 
subjects  at  a  distance  in  combination  with  other  modalities 
(e.g.,  gait). 

The  biggest  issue  that  we  have  observed  is  related  to  the 
color  space  changes  produced  by  white  balancing.  This  step 
has  to  be  performed  carefully,  and  ad-hoc  approaches  to 


white  balancing  may  not  be  sufficient. 
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